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(54) A data processing system for providing user load levelling in a network. 



00 

CO 

o 

00 

s 



Q. 

UJ 



(5?) The present invention provides a data proces- 
sing system, and method of operating such a 
system, for facilitating a connection of a prog- 
ram on a client computer to a server, the server 
consisting of a plurality of server computers 
with shared resources. The data processing 
system, the client computer, and the server 
computers all reside in a logical network. The 
data processing system has an input means for 
receiving a request from the client computer for 
a machine address of a server computer iden- 
tified by a server computer name sent with the 
request, such a machine address enabling a 
connection to be made from the client com- 
puter to that server computer via the network. A 
storage device is provided by the system for 
storing a list identifying server computer names 
with machine addresses of the server compu- 
ters. A conversion means in the system uses the 
list to convert the server computer name re- 
ceived by the input means into the machine 
address of the server computer, and then an 
output means sends the machine address from 
the conversion means to the client computer. 
The system is characterised by decision logic 
for studying the server computers at predeter- 
mined intervals having regard to a predeter- 
mined test criteria, in order to select one of the 
server computers; and writing means for up- 
dating the list by associating the machine ad- 
dress for the server computer selected by the 
decision logic with a particular server computer 
name contained as a generic server computer 



name in the list Using this technique, when a 
client computer specifies the generic server 
computer name, it receives the machine 
address of the server computer identified by the 
decision logic. 
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The present invention relates to a data process- 
ing system for facilitating the connection of a program 
on a client computer to a server, the server consisting 
of a plurality of server computers with shared resourc- 
es. 

The data processing system, the client computer, 
and the server computers are ail resident on a net- 
work. This network need not be one physical network 
such as a Local Area Network (LAN); for instance it 
may consist of a number of LANS or WANS (wide area 
networks) connected together (eg. via 'bridges') to 
form a single logical network. However the same net- 
work protocol will be employed throughout the net- 
work, a typical example of such a protocol being 
TCP/IP which will be familiar to those skilled in the art. 

In many environments it is increasingly found that 
a number of server computers are connected togeth- 
er using some form of network, eg. a LAN. It is often 
the case that several users of client computers will be 
connected to one such server computer, whilst other 
server computers stand idle. An example of this is 
when such computers are situated in workers' offic- 
es, connected by, for example, a token ring LAN using 
the standard TCP/IP network protocol. When workers 
are away from their offices, their computers will usu- 
ally stand idle. 

In such situations it is commonly the case that a 
few of the computers in the network are heavily load- 
ed, whilst other computers in the network are very 
lightly loaded, giving poor response and performance 
for the client computers using the heavily loaded ser- 
ver computers. Hence there is a problem of how to en- 
able the client user load to be spread more evenly 
across the available computing resources of the ser- 
ver in a manner which is transparent to the client com- 
puter and its programs. If transparency is to be ach- 
ieved, standard protocols need to be observed in or- 
der that client computers can use a variety of connec- 
tion methods without modification of any programs 
being required. 

A prior art technique which has been developed 
to provide some sort of load spreading is called "Static 
load levelling". With this technique each application 
on each client computer has a designated server to 
which it always connects. Hence, for example, if there 
are 200 potential clients of a server having five server 
computers, a pre-specified group of, say, 40 of the cli- 
ents will be told (or configured) to always connect to 
machine 1 , etc. On average it may be argued that this 
will give a reasonably even load across all of the ser- 
ver computers. However in practice it is often the 
case that, using this technique, a large number of 
users of client computers are connected to one server 
computer, while an adjacent server computer stands 
completely idle (eg. given the above example there 
could easily be 40 users on one server computer 
whilst the other 4 server computers stand idle). 
Hence in situations where the user loading changes 



from time to time, the prior art static load levelling 
technique is not particularly satisfactory. What is re- 
quired in such instances is a more 'dynamic' techni- 
que which can respond to changing user loads, and 

5 thus direct new users to the most suitable (eg. least 
heavily loaded) server computers in the server. 

Other prior art techniques can be found in other 
environments, such as those where job allocation is 
an issue. For instance in "batch processing", a client 

10 computer submits an encapsulated task to a central 
server, which determines which one of several dossh 
ble servers is quiet enough to be able to handle the 
task. The task is sent to that server, is processed, and 
the results are then sent back to the client (e.g. as a 

15 results file, or by electronic mail). With a batch proc- 
essing system, there is a brief connection to the cen- 
tral server while the job is transferred from the client 
to the server allocated by the central server. After this 
brief connection the client disconnects, and has no 

20 more interaction with the submitted task until it has 
been completed, and the results have been passed 
back to the client by some means. 

However in the situation with which we are cur- 
rently concerned, the dynamic load levelling techni- 

25 que that is required must be able to deal with "inter- 
active" sessions. Rather like a phone conversation, 
the connection between the program on the client 
computer and a particular server computer will persist 
for the duration of the "conversation" session. Hence 

30 the batch processing concept is inappropriate in the 
present situation. 

It is possible to write some specific code within a 
program on a client computer which contains internal 
message-passing systems to route work from that di- 

35 ent program to a corresponding server program. Such 
systems are dedicated only to that particular client 
program, and the connection and load-levelling meth- 
ods are not accessible to other client-server applica- 
tions. Often, such systems operate by the client con- 

40 necting to a specific "host" server computer, and from 
there the work will be sent to another server for proc- 
essing. Clearly this technique can result in large bot- 
tlenecks arising at the "host" server computer. 
Hence such a technique is not suitable in the 

45 present situation since it only supports one very spe- 
cific type of client-server connection, whereas we 
need a technique that will allow any client-server con- 
nection method using the network protocol to be con- 
nected to a quiet server in a way that is completely 

so transparent to the client program. Further the above 
technique relies on an initial connection to the 'cen- 
tral' host server computer, which then passes the re- 
quest on to another server computer; as described 
above this can potentially create a serious bottle- 

55 neck. 

It is an object of the present invention to provide 
a technique which facilitates a connection between a 
client program and a server computer on a server in 
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a way that takes into account the current status of the 
server computers forming the server. This technique 
must observe standard network protocols and should 
operate in a manner which is transparent to the client 
program requesting access. 5 

Accordingly the present invention provides a data 
processing system for facilitating a connection of a 
program on a client computer to a server, the server 
consisting of a plurality of server computers with 
shared resources, the data processing system, the w 
client computer, and the server computers residing in 
a network, the system comprising: input means for re- 
ceiving a request from the client computer for a ma- 
chine address of a server computer identified by a 
server computer name sent with the request, such a 15 
machine address enabling a connection to be made 
from the client computer to that server computer via 
the network; a storage device for storing a list identi- 
fying server computer names with machine address- 
es of the server computers; conversion means for us- 20 
ing the list to convert the server computer name re- 
ceived by the input means into the machine address 
of the server computer; output means for sending the 
machine address from the conversion means to the 
client computer; the system being characterised by: 25 
decision logic for studying the server computers at 
predetermined intervals having regard to predeter- 
mined test criteria, in order to select one of the server 
computers; and writing means for updating the list by 
associating the machine address for the server com- 30 
puter selected by the decision logic with a particular 
server computer name contained as a generic server 
computer name in the list; whereby when a client com- 
puter specifies the generic server computer name, it 
receives the machine address of the server computer 35 
identified by the decision logic. 

Typically the conversion means will access the 
list from a local piece of storage, the data processing 
system having a copier to copy the list from the stor- 
age device to that piece of memory. In preferred em- 40 
bodiments the data processing system further com- 
prises a messaging means, responsive to the updat- 
ing of the list by the writing means, for sending a mes- 
sage to the copier requesting the copier to copy the 
updated list into the piece of local memory. 45 

Any manner of predetermined test criteria can be 
used in the data processing system of the invention, 
for example the amount of idle processor time, the 
number of processes running, the amount of free 
memory, the load average", etc. However in prefer- so 
red embodiments the predetermined test criteria are 
such that the decision logic identifies the server com- 
puter having the least number of client programs log- 
ged on to it 

In preferred embodiments the predetermined in- 55 
tervals are variable and will either be set by a user of 
the system, eg. the system administrator, or will be 
adjusted dynamically. The user will also set the pre- 



determined test criteria to be used by the decision log- 
ic. 

Further in preferred embodiments the user can 
limit the number of server computers which the deci- 
sion logic studies. This may be useful if, for instance, 
some of the server computers have not got access to 
all of the resources that other server computers have 
access to, and so would not be suitable as server 
computers to be associated with the generic server 
computer name. 

In some embodiments it may be advantageous to 
use a plurality of generic names. Each server name 
would then have a number of server computers 
whose machine addresses are associated with that 
generic name, the decision logic employing different 
sets of predetermined test criteria for each generic 
name. In such embodiments one or more of the ser- 
ver computers can be associated with a plurality of 
the generic names. 

Viewed from a second aspect the present inven- 
tion provides a method of operating a data processing 
system to facilitate a connection of a program on a cli- 
ent computer to a server, the server consisting of a 
plurality of server computers with shared resources, 
the data processing system, the client computer, and 
the server computers residing in a network, the meth- 
od comprising the steps of: (a) receiving a request 
from the client computer for a machine address of a 
server computer identified by a server computer 
name sent with the request, such a machine address 
enabling a connection to be made from the client 
computer to that server computer via the network; (b) 
storing a list identifying server computer names with 
machine addresses of the server computers in a stor- 
age device; (c) converting, with reference to the list, 
the server computer name received at step (a) into 
the machine address of the server computer; (d) 
sending the machine address identified at step (c) to 
the client computer; the method being characterised 
by the steps of: (e) employing decision logic to study 
the server computers at predetermined intervals hav- 
ing regard to predetermined test criteria, in order to 
select one of the server computers; and (f) updating 
the list by associating the machine address for the 
server computer selected by the decision logic with a 
particular server computer name contained as a gen- 
eric server computer name in the list; whereby when 
a client computer specifies the generic server com- 
puter name at step (a), it receives the machine ad- 
dress of the server computer identified by the deci- 
sion logic. 

The present invention will be described further, 
by way of example only, with reference to an embodi- 
ment therof as illustrated in the accompanying draw- 
ings, in which: 

Figure 1 is a block diagram illustrating the data 

processing system of the preferred embodiment; 

Figure 2 is a flow diagram illustrating how the de- 
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ciston logic in the data processing system of the 
preferred embodiment operates; and 
Figure 3 illustrates a particular embodiment 
where two generic computer names are used. 
In the preferred embodiment we will consider the 5 
situation where the server in question is a high per- 
formance database server which has its data distrib- 
uted across a network of server computers, this ser- 
ver network hereafter being referred to as a cluster. 
Database applications being run by users on client 10 
computers are required to connect to one of the ser- 
ver computers in the cluster to enable them to access 
the data in the database server. By the nature of the 
database system, it does not matter which server 
computer the client connects to - ail of the data is ac- 15 
cessible from any server computer in the cluster. In 
the preferred embodiment the server computers and 
the client computers are all interconnected using 
TCP/IP on a token ring Local Area Network. 

For a large number of users, it is highly desirable 20 
to have a number of users on each of the server com- 
puters in the server cluster, rather than all users con- 
necting to (and hence overloading) just one or a few 
of the server computers. With a widely varying user 
workload profile for the database server, this problem 25 
can only be solved by providing some form of load 
levelling" process, which will allocate new client appli- 
cation instances to server computers in the cluster 
that are most suitable for the client connection (eg. 
because they are more lightly loaded than other ser- 30 
ver computers). Clearly this process must be dynam- 
ic, able to respond to changing load conditions over 
time. Since the database applications on the client 
computers are typically complex and often are sup- 
plied only in object code form, it would be very difficult 35 
(or impossible) for the system administrator to alter 
them, and so it is essential that this allocation is done 
in a manner which is entirely transparent to the client 
application. 

The manner in which the data processing system 40 
of the preferred embodiment solves the above prob- 
lems will now be described with reference to Figure 1 . 

Each client computer in a network using the 
TCP/IP protocol (there will typically be many such cli- 
ent computers) will have been informed by the net- 45 
work administrator that it is to communicate with a 
particular computer when it wishes to convert a com- 
puter name of another computer in the network into a 
machine address. When utilising the present inven- 
tion that computer will be the data processing system 50 
of the preferred embodiment. 

Hence when a program running on a client com- 
puter 20 (for clarity, only one client computer is illu- 
strated) wishes to obtain access to a server computer 
(40, 50, 60) in the cluster it will communicate with the 55 
data processing system 10 in order to obtain a full In- 
ternet machine address for the desired server (Inter- 
net addressing is part of the TCP/IP protocol). With 



the prior art technique the client computer would 
specify a server computer name in this communica- 
tion that was specific to one particular server comput- 
er in the cluster. The input means 30 of the data proc- 
essing system 10 would receive this server computer 
name and pass it to the conversion means 70. 

In a storage device 80 of the data processing sys- 
tem a list is maintained which identifies server com- 
puter names with particular Internet addresses. 
When the conversion means is initiated the copier 90 
copies this list from the storage device 80 into a piece 
of local memory 100 accessible by the conversion 
means 70. Hence the conversion means will access 
the list in memory 100 to find the Internet address of 
the computer associated with the server computer 
name passed to it by the input means 30. This Internet 
address will then be provided by the conversion 
means to the output means 110 for transmission back 
to the client computer 20. 

Once the client computer has the Internet ad- 
dress it can then make direct contact with the server 
computer residing at the Internet address provided; in 
Figure 1 this is server computer 2. Since the TCP/IP 
protocol is used any of the access methods that use 
this protocol can be used to access the server com- 
puter. 

When using the data processing system of the 
preferred embodiment the program running on the cli- 
ent computer 20 will not use the server computer 
name that it previously used. Instead a generic server 
computer name will be used. This generic server 
name will either have been placed in the program's 
configuration file, or alternatively the user of the pro- 
gram will specify the generic name when running the 
program. 

Within the data processing system, decision logic 
120 is provided which periodically studies the server 
computers in the cluster having regard to some pre- 
determined test criteria, hereafter called the metric 
string. In the preferred embodiment the metric string 
is a list of questions which when answered by the va- 
rious server computers will enable the decision logic 
to decide which server computer is most suitable for 
a client connection (the most suitable perhaps being 
the least heavily loaded server computer). The metric 
string can be altered as the system administrator 
deems appropriate, depending on what criteria the 
administrator wishes to be used to select a server 
computer. 

In the preferred embodiment the decision logic 
actually sets up a number of child processes, each 
one being responsible for sending the metric string to 
a particular server computer and receiving the re- 
sponse from the server computer. 

Once the responses have been received the de- 
cision logic will collate the responses, decide which 
server computer is most suitable, and then request 
the writing means 1 60 to pass the internet address of 
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that server computer to the storage device for asso- 
ciation with the generic server computer name. If 
however the most suitable server computer is the 
same server computer as that identified in the last 
iteration of the process then there is no need to up- 5 
date the storage device and the writing means wilt not 
be activated. 

Once any necessary update to the storage de- 
vice 80 has been made the messaging means 1 70 will 
notify the copier 90 so that the copier updates the lo- w 
cat memory 1 00 with the new list as stored in the stor- 
age device 80. 

Hence when the client computer requests a ma- 
chine address for the generic server computer name 
the conversion means 70 accesses the list in memory 15 
1 00 and identifies a machine address just as it would 
if any other server computer name had been given. 
However in this instance the machine address actual- 
ly relates to the server computer in the cluster which 
has been identified by the decision logic as the most 20 
suitable (eg. least heavily loaded). When this ma- 
chine address is passed back to the client computer 
20 via the output means 11 0, the client computer will 
proceed to automatically access the server which is 
most suitable. 25 

By this approach it will be seen that a dynamic 
load levelling facility is provided which is completely 
transparent to the client program. As far as the pro- 
gram is concerned it is requesting a machine address 
as normal and is using one of the normal TCP/IP ac- 30 
cess methods to gain access to the server computer 
allocated to it 

In many of todays computing environments (eg 
Unix, AJX (Unix is a Trade Mark of Unix Systems Lab- 
oratories Inc)) an application is provided to perform 35 
the standard name resolution service (ie receipt of a 
computer name and conversion of that computer 
name into a full Internet address). This application is 
commonly known as a "nameserver" application, and 
is installed on one or more computers in the logical 40 
network. Every other computer in the network is told 
to communicate with a specified one of these 'name- 
server' computers when it wishes to determine an In- 
ternet address for any other computer in the network. 
Hence a nameserver computer provides a resolution 45 
service to client computers by receiving from them a 
convenient name given to a particular computer (eg. 
abc.def.ghi.com), and converting it into a full Internet 
address (eg. 29.1.19.66). This Internet address is 
then used by the routing subsystem (TCP/IP) to allow 50 
a client user or application access to the physical 
computer (eg "abc" in this example). 

In the above example of a computer name, "abc" 
is the physical machine, "def is typically the site lo- 
cation, "ghi" the organisation, and "com" one of the In- 55 
ternet classes (three such classes are (com)mercial, 
(edu)cation, (mil)itary). Domains and sub-domains 
can also be added as part of this computer name. Ba- 



sically the name takes a hierarchical form, with the 
finest resolution at the beginning and the coarsest re- 
solution at the end; this type of naming structure will 
of course be well known to those skilled in the art. 

All TCP/IP-based applications, including remote- 
login, remote-shell, telnet, ftp, and also client-server 
applications (such as database applications), are 
aware of the nameserver facility, and will automati- 
cally go to the designated nameserver computer to 
ask for resolution of a computer name into an Internet 
address before attempting to make a connection to 
another computer in the network. 

If we consider Figure 1 again, the standard name- 
server facility will include the following elements: the 
input means 30, the conversion means 70 with asso- 
ciated memory 100, the output means 110, the list 
stored in the storage device 80, and the copier 90. 

The nameserver application is a "daemon" (back- 
ground) process which runs on the data processing 
system; this data processing system may (but need 
not) be one of the server computers forming part of 
the cluster over which users are to be distributed. In 
Unix-type operating systems (eg. AJX by IBM Corpor- 
ation, Ultrix by Digital Equipment Corporation, OSF/1 
by the Open Software Foundation, and HP-UX by 
Hewlett Packard, etc) this daemon process is called 
"named" (name-daemon), and when it is initialised, it 
reads a special database file {named. data) stored on 
the storage device 80 to obtain details of the comput- 
er names about which it is expected to know (over 
which it has "authority"), and the corresponding Inter- 
net addresses ("dotted decimal", e.g. 29.1.19.66) for 
each computer name. Whilst the name daemon is op- 
erating, it can be forced to re-read the information 
from the named.data database file by the sending of 
an inter-process signal to the name daemon process 
telling it to update its internal tables 100 from the da- 
tabase file (named.data). 

In the preferred embodiment of the present inven- 
tion we provide a further facility which runs on the 
same computer as the nameserver application 
("named"), and interfaces with it. A 'generic* comput- 
er name is introduced into the database file 
(named.data), which refers not to one specific com- 
puter, but to any one of a number of computers offer- 
ing equivalent functionality. For example, the generic 
name might be "server.cluster.def.ghi.com"; a client 
program requesting a connection to 'server.cluster' is 
requesting connection to any one of the computers in 
the server cluster. 

The further facility provided by the preferred em- 
bodiment will be referred to hereafter as the "User 
Load Leveller" (ULL) application. This application is 
responsible for deciding which server computer in the 
cluster is currently the least heavily loaded, according 
to some appropriate metric, and for conveying this in- 
formation to the nameserver application. Then sub- 
sequent requests for resolution of the generic server 
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computer name to an Internet address result in the 
nameserver application sending back to the client 
computer the Internet address of the server computer 
which has been deemed to be the most appropriate 
server computer for connection at that point in time. s 

The ULL application consists of the following ele- 
ments f rom figure 1 : the decision logic 1 20 with child 
processes 130, 140, 150; the writing means 160; and 
the messaging means 170. As described earlier with 
reference to Figure 1 the ULL application periodically w 
(at a frequency which can varied (eg. tuned by a sys- 
tem administrator or dynamically adjusted)) polls the 
server computers in the cluster to determine how 
"busy" in some sense they are. The metric used may 
vary, depending on the type of work which is being 15 
handled by the cluster, but may for example include 
the number of login sessions, number of application 
instances running, number of idle cpu cycles since the 
last poll, etc. The metric can be altered to ensure that 
it is appropriate to a specific situation. 20 

Based on the results of this polling, and taking 
into account the situation where a server computer in 
the cluster is too busy to respond to the status request 
within a certain number of seconds, the ULL applica- 
tion decides which machine is currently the least 25 
heavily loaded. The ULL application then modifies 
the database file (named.data) to associate the gen- 
eric cluster machine name with the Internet address 
of this least heavily loaded machine, and sends the 
special inter-process signal via the messaging means 30 
170 which tells the nameserver application to re-read 
its database file. The nameserver application will 
then, in response to a name resolution request from 
a client program, resolve the generic server computer 
name into the Internet address of the most appropri- 35 
ate server computer in the cluster for the client pro- 
gram to connect to. 

The process carried out by the decision logic 120 
of the preferred embodiment will now be described in 
more detail with reference to Figure 2. At step 200 the 40 
ULL application is initialised. A number of steps are 
carried out at initialisation. For example the applica- 
tion: checks for multiple copies of the ULL application 
in memory; cleans up from a previous run of the ap- 
plication (by freeing up system resources such as 45 
memory, locks and semaphores still held in the name 
of the previous instance of the ULL application); and 
locates the nameserver application (named) and its 
data file (named.data). The ULL application then 
parses its configuration file to read information de- 50 
fined by the system administrator, such as the metric 
strings, poll periods, identities of server computers in 
the cluster, etc. Further the ULL application generates 
a number of "child 0 processes - one per server com- 
puter - which are each responsible for polling the ao 55 
tivity of one designated server. 

Once the initialisation has been completed the 
process enters a main loop which executes until the 



ULL application is terminated. At ste ( ps 210, 220, 230 
and 240 the child processes send a metric string (as 
defined by the system administrator) to each server 
computer in the cluster, await responses from those 
computers, and then wait for a trigger signal from the 
main ULL application. 

Once the trigger signal has been sent by the main 
application the responses are sent by the child proc- 
esses to the main application. The main application 
then collates the activity results received from the 
child processes (step 250), and based on predeter- 
mined test criteria identifies the most appropriate ser- 
ver computer (the "least busy" server computer) at 
step 260. At step 270 it is determined whether the ser- 
ver computer identified at step 260 differs from the 
current nominated server computer. If it does then the 
process advances to step 290, at which point the 
nameserver's data file (named.data) is modified. 
Further at step 300 a notification signal is sent to the 
nameserver application (named) to tell it to update its 
internal information from the data file. 

The process then proceeds to step 280. If at step 
270 it is determined that the server computer identi- 
fied at step 260 is the same as the current nominated 
server computer then the process moves straight to 
step 280 without steps 290 and 300 being carried out 
Writing to and reading from the data file are time con- 
suming activities and so steps 290 and 300 should 
only be performed when necessary (ie when the 
"least busy" server computer changes). 

At step 280 the process waits until the end of the 
"poll period". This period is the predetermined interval 
(as defined by the system administrator) between 
successive studies of the server computers by the 
ULL application. Once the poll period has expired the 
process loops back to steps 210-240 and the main 
loop is repeated. 

Having discussed the preferred embodiment a 
few possible alterations will now be discussed. Firstly 
more than one generic server computer name can be 
added to the list in storage device 80 (the named.data 
file). Each generic name could be associated with a 
particular group of server computers, these groups 
being either completely separate or having a few ser- 
ver computers common to a plurality of the groups. 
Indeed one group may be a subset of another group. 
As an example consider Figure 3. A server cluster 410 
comprises eight server computers 400. All eight ser- 
ver computers have access to a main body of data, 
but only four of them have access to some further 
(possibly more confidential) data. 

In this situation two generic names could be gen- 
erated, eg. "general.cluster" and "specific.cluster". 
Any one of the eight computers (enclosed by ring 430) 
can be associated with the former generic name, but 
only the four enclosed by ring 420 can be associated 
with the latter generic name, since only those four 
have access to the further (confidential) data. 
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The system administrator can then set up the 
metric string to be used when studying ail eight server 
computers, or when studying only the four in ring 420; 
the metric string could be the same in both instances 
but need not be. If a client application needs access 5 
to the confidential information then it would request 
access to "specif ic.cluster", but if an application only 
needed access to the general information then it 
would request access to "general. cluster". 

By this approach an application which only needs w 
access to the general information will always be con- 
nected to the least busy server computer, whilst an 
application which needs access to the further (confi- 
dential) information will be given the machine address 
of the least busy server computer that can actually 15 
provide the necessary service; this may or may not be 
the least busy server computer in the network. 

In preferred embodiments a further feature is 
provided to enable the decision logic to temporarily 
implement a "round-robin" metric instead of the 20 
above described 'studying' process. The round-robin 
principle will be familiar to those skilled in the art; ba- 
sically when a client application requests access to a 
server computer it is assigned a particular server 
computer, and when the next request is received then 25 
that application is assigned the next server computer 
in the cluster, and so on. In this way the server com- 
puters are rotated so that each successive server ac- 
cess is made on a different server computer to the 
previous server access. Alternatively the server com- 30 
puters can be rotated at fixed time intervals rather 
than after each access request. 

Although the round-robin technique does not 
have regard to the loading on any of the server com- 
puters, and so there is no determination of the least 35 
busy server computer, there are certain situations 
(eg. where there are lots of client applications which 
only take a short amount of database connection 
time) where a round-robin approach is acceptable. To 
implement the round robin approach the decision log- 40 
ic 120 would ask the writing means 160 to update the 
storage device 80 after each access request has 
been handled (or at fixed time intervals if the alterna- 
tive approach is used), so that the generic name is al- 
ways associated with successive server computers in 45 
the cluster in turn. 

From the above description it will be clear that the 
system of the preferred embodiment has a number of 
advantages. Firstly the technique dynamically allo- 
cates new client users and applications to the server 50 
computer which is least heavily loaded at the time 
they make the connection, thus ensuring an even dis- 
tribution of users and applications across all of the 
available server computers. The client computer only 
briefly contacts the data processing system of the 55 
preferred embodiment to resolve the generic comput- 
er name into a machine address. Completely stan- 
dard access methods (eg. as provided by TCP/IP) are 



then used to make the connection, thus avoiding any 
proprietary protocols or any need to modify access 
methods or applications, and so providing fully trans- 
parent user load levelling. 

Secondly the technique of the preferred embodi- 
ment does not involve any modification to the name- 
server code - the User Load Leveller application in- 
terfaces with the standard code (eg. "named" as ship- 
ped with the unix/AlX operating system). It would be 
possible to provide similarf unctionality to that descri- 
bed here by producing a modified version of the 
nameserver code. However, avoiding this brings ma- 
jor advantages from both marketing and maintenance 
points of view. 

Further the technique can be operated without re- 
quiring any modification to the server computers. 
They are accessed in a standard way after the gen- 
eric server computer name has been used to provide 
the client computer with a machine address. 

Another advantage is that the key parameters, 
such as the time interval between polls of the server 
computers in the cluster and the metric used to deter- 
mine which server computer is least heavily loaded, 
can be altered and tuned by a local system adminis- 
trator, allowing the system to be optimised for a par- 
ticular situation. 

The above described ULL application could be 
supplied as a separate tool to enhance the useability 
of parallel and distributed systems, or could be ship- 
ped with the nameserver application. 



Claims 

1. A data processing system for facilitating a con- 
nection of a program on a client computer to a 
server, the server consisting of a plurality of ser- 
ver computers with shared resources, the data 
processing system, the client computer, and the 
server computers residing in a network, the sys- 
tem comprising: 

input means for receiving a request from the cli- 
ent computer for a machine address of a server 
computer identified by a server computer name 
sent with the request, such a machine address 
enabling a connection to be made from the client 
computer to that server computer via the net- 
work; 

a storage device for storing a list identifying ser- 
ver computer names with machine addresses of 
the server computers; 

conversion means for using the list to convert the 
server computer name received by the input 
means into the machine address of the server 
computer; 

output means for sending the machine address 
from the conversion means to the client comput- 
er 
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the system being characterised by: 
decision logic for studying the server computers 
at predetermined intervals having regard to pre- 
determined test criteria, in order to select one of 
the server computers; and 5 
writing means for updating the list by associating 
the machine address for the server computer se- 
lected by the decision logic with a particular ser- 
ver computer name contained as a generic server 
computer name in the list; w 
whereby when a client computer specifies the 
generic server computer name, it receives the 
machine address of the server computer identi- 
fied by the decision logic. 

15 

2. A system as claimed in Claim 1 further compris- 
ing: 

a copier to copy the list from the storage 
device to a piece of memory accessible by the 
conversion means; 20 

a messaging means, responsive to the up- 
dating of the list by the writing means, for sending 
a message to the copier requesting the copier to 
copy the updated list into the piece of local mem- 
ory. 25 

3. A system as claimed in Claim 1 or Claim 2, where- 
in the predetermined test criteria are such that the 
decision logic identifies the server computer hav- 
ing the least number of client programs logged on 30 
toil 

4. A system as claimed in any of claims 1 to 3, 
wherein the predetermined intervals are variable. 

35 

5. A system as claimed in any preceding claim, 
wherein the predetermined test criteria are set by 
a user of the system. 

6. A system as claimed in any preceding claim, 40 
wherein the user can limit the number of server 
computers which the decision logic studies. 

7. A system as claimed in any preceding claim 
wherein a plurality of generic names are used, 
each one having a number of server computers 
whose machine addresses are associated with 
that generic name, the decision logic employing 
different sets of predetermined test criteria for 
each generic name. 

8. A system as claimed in Claim 7, wherein one or 
more of the server computers are associated with 
a plurality of the generic names. 

9. A method of operating a data processing system 
to facilitate a connection of a program on a client 
computer to a server, the server consisting of a 



plurality of server computers with shared re- 
sources, the data processing system, the client 
computer, and the server computers residing in a 
network, the method comprising the steps of: 

(a) receiving a requestfrom the client comput- 
er for a machine address of a server computer 
identified by a server computer name sent 
with the request, such a machine address 
enabling a connection to be made from the cli- 
ent computer to that server computer via the 
network; 

(b) storing a list identifying server computer 
names with machine addresses of the server 
computers in a storage device; 

(c) converting, with reference to the list, the 
server computer name received at step (a) 
into the machine address of the server com- 
puter; 

(d) sending the machine address identified at 
step (c) to the client computer; 

the method being characterised by the steps 
of: 

(e) employing decision logic to study the ser- 
ver computers at predetermined intervals 
having regard to predetermined test criteria, in 
order to select one of the server computers; 
and 

(f) updating the list by associating the ma- 
chine address for the server computer select- 
ed by the decision logic with a particular ser- 
ver computer name contained as a generic 
server computer name in the list; 

whereby when a client computer specifies the 
generic server computer name at step (a), it re- 
ceives the machine address of the server com- 
puter identified by the decision logic. 

10. A method as claimed in Claim 9 further compris- 
ing the steps of: 

copying the list from the storage device to 
a piece of memory accessible at the conversion 
step (c); 

repeating, in response to the updating of 
the list at step (f), the copying step to ensure that 



45 the updated list is copied into the piece of local 
memory. 

11. A method as as claimed in claim 9 or claim 10, 
wherein the predetermined intervals are set by a 

50 user of the system. 

12. A method as claimed in any of claims 9 to 11, 
wherein the predetermined test criteria are set by 
a user of the system. 

55 

13. A method as claimed in any of claims 9 to 12, 
wherein the user can limit the number of server 
computers which the decision logic studies. 
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14. A method as claimed in any of claims 9 to 13, 
wherein a plurality of generic names are used, 
each one having a number of server computers 
whose machine addresses are associated with 
that generic name, the decision logic employing 5 
different sets of predetermined test criteria for 
each generic name. 

15. A method as claimed in Claim 14, wherein one or 
more of the server computers are associated with w 
a plurality of the generic names. 
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